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In Lyman-a forest measurements it is generally assumed that quasars are mere background light 
sources which are uncorrelated with the forest. Gravitational lensing of the quasars violates this 
, assumption. This effect leads to a measurement bias, but more interestingly it provides a valuable 

' signal. The lensing signal can be extracted by correlating quasar magnitudes with the flux power 

spectrum and with the flux decrement. These correlations will be challenging to measure but their 
O . detection provides a direct measure of how features in the Lyman-a forest trace the underlying mass 

density field. Observing them will test the fundamental hypothesis that fluctuations in the forest 
are predominantly driven by fluctuations in mass, rather than in the ionizing background, helium 
reionization or winds. We discuss ways to disentangle the lensing signal from other sources of such 
correlations, including dust, continuum and background residuals. The lensing-induced measure- 
ment bias arises from sample selection: one preferentially collects spectra of magnified quasars which 
are behind overdense regions. This measurement bias is ~ 0.1 — 1% for the flux power spectrum, 
. optical depth and the flux probability distribution. Since the effect is systematic, quantities such as 

^ ' the amplitude of the flux power spectrum averaged across scales should be interpreted with care. 

6 

ti. 

C/5 , I. INTRODUCTION 



o 



o 



> 



PACS numbers: 98.80.-k; 98.80.Es; 98.65.Dx; 95.35. -|-d 



Light from distant galaxies and quasars is gravitationally lensed by mass along the hne of sight. For a flux-limited 
survey of quasars, lensing magnification biases the observed number density llH3|. For example, a positive mass 
fluctuation along the Une of sight can increase the apparent sky area and the observed flux. The geometrical area 
increase decreases the quasar number density, while the flux increase promotes intrinsically faint objects above the 
magnitude threshold increasing the source density. Together these effects introduce a correction to the quasar number 
density called magnification bias. Detections of the magnification of distant quasars by low-redshift galaxies confirm 
the presence of this effect (e.g. 
. Many authors have studied the effect of lensing magnification on observations of the galaxy correlation function and 
' power spectrum. Another way of inferring the large-scale mass distribution is through measurements of the neutral 
Hydrogen density. As light from distant quasars passes through clouds of neutral Hydrogen, photons with rest- 
frame frequency at the Lyman-a transition (1216A) are absorbed. The observed quasar spectrum contains troughs 
corresponding to absorption by neutral Hydrogen at redshift z = I'a/ii^ohsi^ + ^^pec)) — 1, where Vohs is the observed 
' frequency of the trough, Vpoc is the peculiar velocity at z (speed of light is set to unity), and i/q. is the Lyman-a 
H I frequency.^ This Lyman-a forest (of absorption features in the quasar spectrum) is a cosmological tool that can be 
. - - 1 used to probe the neutral Hydrogen along the line of sight (see for example and references therein). Here we ask 
two questions: what is the effect of gravitational lensing on measurements of the Lyman-a forest? And how can we 
extract the gravitational signal from such measurements? 

Let us first discuss qualitatively what we expect to happen. Consider a single line of sight towards a quasar at 
comoving distance XQ- Lensing magnification changes the observed quasar flux /, but in a frequency (i/) independent 
way 

f{iy)^f{iy)-KxQ) (1) 

where fJ-{xQ) is the magnification which depends on the density fluctuations along the line of sight. Since all frequencies 
are treated in the same way, fluctuations in the flux as a function of frequency are unaffected — just as if we were 



^ Here, the redshift z refers to the intrinsic or cosmological redshift, i.e. the redshift if the absorbing materials were comoving. The 
observed redshift and the intrinsic redshift are related by 1 -|- z^hs = (1 + 2)(1 + fpcc)- 
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looking at an intrinsically brighter or dimmer quasar. Measurements of for example, the flux power spectrum, from 
an individual line of sight are unchanged by magnification. On the other hand, magnification does change which lines 
of sight are observed. More precisely, a selection bias is introduced: more (fewer) quasars are observed along lines of 
sight that lead to a positive (negative) magnification correction to quasar number counts. Let n{xi ^) be the number 
density of quasars at comoving distance x in direction 9. Under the effect of lensing magnification, 

in lU j ame[m)n}'['m) 

Here, rPirn) is the luminosity function of quasars at magnitude m (i.e. dmnP{m) is the number density of quasars at 
magnitude to ± dm/2), and e(rn) quantifies the sample definition - the simplest example is a step function which cuts 
off all quasars fainter than some limiting magnitude miim., in which case s reduces to the more familiar expression 
s = dlogiQN{< TOiim.)/d"T-iim., whcrc iV(< TOiim.) is the total number of quasars brighter than TOum.. This is a 
well known result, but a derivation is summarized in Appendix]^ With lensing magnification, the lines of sight we 
observe do not constitute a fair sample of the density field. Therefore, while lensing magnification has no effect on 
measurements of the flux fluctuation from a single quasar line of sight, measurements dependent upon the density held 
averaged over multiple quasars will be biased. This occurs since the observable (e.g. the neutral hydrogen density) is 
correlated with the magniflcation which depends on the gravitational potential along the line of sight. 

What is perhaps the more interesting question is how we could extract the lensing signal from observations of 
the forest. The discussions above make clear gravitational lensing affects both the observed brightness and number 
density of quasars. One could imagine cross-correlating these quantities with the Lyman-alpha forest observables. We 
will consider several possibilities and identify the ones with an interesting signal-to-noise. 

The rest of the paper is organized as follows. In miA\ we develop a simple description of the effect of lensing 
magnification on measurements of the Lyman-a forest, the key result is Eq. ([5]). In mi B\ - mi El we present the effect 
of lensing bias on the flux power spectrum, the effective optical depth and the flux probability distribution (PDF) 
determined by applying a biased weighting scheme to mock absorption spectra from simulations and compare these 
results with analytical estimates in terms of the flux-mass polyspectra. In mill we discuss the exciting possibility 
of observing the lensing-induced correlation between quasar magnitudes and the flux fluctuations, the flux power 
spectrum or the mean flux. While this work is dedicated to discussing magnification bias, dust along the line of sight 
would have a similar (though generally opposite signed) effect, this is discussed in mil El We present concluding 
remarks and discuss the implications of this work for existing and future Lyman-a measurements in i^IVI Appendix 
\X\ contains a unified derivation of the magnitude-forest correlations and the lensing bias discussed in the paper. In 
Appendix |B] we discuss some issues regarding large-scale flux-mass correlations measured from our simulations and 
the accuracy of an analytic description. Appendix [Cl derives an estimator for the flux- magnitude correlation and its 
associated error. 

Before proceeding we should mention some related literature. The magnification bias to the statistics of metal 
absorption lines in quasar spectra was investigated in and a method for detecting statistical lensing by absorbers 
was proposed in Q . Lensing effects on the statistics of damped Lyman-a systems and the inferred density of neutral 
Hydrogen was considered in [l^. More recently, [ll| studied magnification bias due to intervening absorbers in the 
2dF quasar survey and [l^ in SDSS. Recent work by [Tsl . IT^ proposed correlating lensing in the cosmic microwave 
background with fluctuations in the forest to extract the flux-mass correlation. 



II. LENSING AS NOISE/BIAS 
A. Formalism 



We are interested in some Lyman-alpha forest observable O, which can represent the flux power spectrum, the flux 
transmission/decrement, the flux fluctuation (around its mean), and so on. Let us use Oi to denote the observable 
measured from a quasar labeled by /. We typically form an estimator by averaging over quasars: 

Oo.. = (3) 

where wj denotes weights, the simplest example of which is wj — I. 

Two important points. First O/, the observable on a quasar by quasar basis, is generally not affected by gravitational 
lensing. This is because gravitational lensing affects all wavelengths equally. For instance, Oj could represent the flux 
transmission which is ///c (where / and fc are the flux and continuum respectively as a function of frequency), or 
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the flux fluctuation Sf = {f — f)/f (where / is the flux averaged for the particular hne of sight in question). Since 
gravitational lensing brightens or dims /, / and fc all equally independent of wavelength, there is no effect on Oj. 

The other important point is that Cobs is generally affected by lensing. The crucial observation is that wj in Eq. 
([5]) reveals only part of the weighting that is going on; in any given dataset, we inevitably give zero weights to quasars 
which are too faint to observe. This means that even if one chooses w/ = 1, one is merely performing a straight 
average over one's sample, as opposed to an average over all possible quasars. To account for this fact, it is helpful to 
pixclizc the sky, and rewrite Cobs as 

Cobs = — [V 

l^i WiUi 

where i is the pixel label. One could conceptually think of each pixel as sufHciently small that the number of quasars 
in it Tii is either 1 or 0; we will more generally think of rii as simply the number density of quasars in pixel i. This 
way of rewriting is useful because it makes explicit the fact that Cobs is a weighted average - it is weighted by the 
number density of quasars, on top of whatever additional weighting Wi one might wish to apply. In other words, since 
our Lyman-alpha forest observable can be measured only if there exists a background quasar in the sky location of 
interest, the observable is always implicitly weighted by the abundance of quasars. 
Eq. dH tells us that 

=<"*™'^(l + (5f) (5) 
where 77,™*''™^!'= is the intrinsic pre-lensed quasar number density, and (5f is the lensing modification given by 

5f ^ (5s - 2)k, , - dj('^^^^x!^l(p{x: , i) (6) 

Jo XQ 

assuming fluctuations are small. The expression for the convergence Ki assumes spatial flatness, but can be easily 
generalized to non-flat universes {xq is the comoving distance to quasar). Here, 1 + 2k is the weak lensing approxi- 
mation to the lensing magnification /i of Eqs. ([1} and (HI (the full nonlinear expression is given in footnote [7]) . Using 
= 1 in Eq. (|4]), assuming there is no correlation between the forest observable and the intrinsic quasar number 
fiuctuation, and ignoring corrections of the integral constraint type (see Appendix Rl and for example, |15|). we obtain 
the following ensemble average, to the lowest order in fluctuations: 

(Cobs) - Otruc + {0^5l') (7) 

where {OiS^) is the correlation between the observable and the magnification fluctuation at the same i (i.e. zero lag). 
Dropping the i label, the lensing induced measurement bias on an observable O is therefore 

(Gobs) - Otruc = {05^^) = (5.S - 2){Ok) . (8) 

This is a fundamental expression that we will use repeatedly below. The meaning of each of these symbols is as 
follows: O is the fluctuation which we attempt to measure with the estimator Cobs, which upon ensemble averaging 
generally differs from the true underlying value Otruo = {O) . 

It is important to emphasize the actual measurement bias could be different. The above estimate assumes that all 
quasars within one's sample are weighted equally (i.e. Wi = 1). In practice, one might want to weigh brighter quasars 
within one's sample more strongly. For instance, one could weigh by the net flux of the quasar, which corresponds to 
inverse variance weighting in the noise dominated regime. This would result in a lensing induced measurement bias 
of (see Appendix 1X1 for derivation) 

(abs)-Otruc = (5.s'-2)(OAi). (9) 

where s' is defined as 

1 /dme(m)(dnO/dTO)10-'"/2-5 
* ~ In 10 / dm e(m)n"(77i)10-™/2.5 (1^) 

which can be contrasted with s defined in Eq. For a step function sample definition e, s' reduces to dlogiQ N'{< 
TOiim.)/d"T-iim. + 0.4, where N' = /T^" dmn°lO~"^^^^^ . This is generally larger than s. 

The precise value of s or s' is sample dependent. The relatively low rcdshift (1 < z < 2.2) Sloan Digital Sky 
Survey (SDSS) quasars that were used to measure the magnification-galaxy cross-correlation have a slope that spans 
— 1 < 5s — 2 < 1.9, depending on the magnitude cut Q. For this paper, the higher rcdshift quasars for which the 
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Lyman- alpha forest falls within the SDSS spectral range are more relevant. In Fig. [7] (in Appendix R)) we show the 
cumulated number counts and rough estimates of s and s' from SDSS data release 6. 

In the rest of this paper, we will adopt s = 1 (or s' = 1), so our results for the lensing biases can be scaled up and 
down by (5s - 2)/3 or (5s' - 2)/3. 

We will also adopt the following set of cosmological parameters in presenting numerical estimates. We assume a 
flat ACDM cosmology with cosmological parameter values ilm = 0.3, Ha = 0.7, ilb = 0.04 for the fractional densities 
in matter, vacuum and baryons; scalar spectral index = 1; Hubble parameter today Hq = lOO/i km/s/Mpc with 
h — 0.7 and fluctuation amplitude cts = 0.9. The speed of light is set to unity. In a few places we use analytic 
calculations of the matter and baryon power spectra. The 3-D matter power spectrum is calculated using the transfer 
function of [l6j with a modified shape parameter F = fi^/iexp [— r2f,(l -|- -s/STi/rim)] [T^j, and the non-linear evolution 
is modeled according to [l^- To relate baryons to mass we assume the 3-D Fourier space fluctuations are related by 
(5f,(k) = exp [— fc^/fcg](5(k), where kg = A;j-\/l0/3 and fcj is the Jeans scale (see Appendix [B] for more details). 

B. Lensing Bias from Simulations 

Lensing magnification biases measurements of the forest so long as the forest observable is correlated with the 
lensing convergence (Eq. ([8])). Since features in the Lyman-a forest are caused by absorption by gas which itself 
traces gravitational potentials, there ought to be a correlation between absorption features and the lensing convergence. 
However, the mapping between absorption-induced fluctuations in the flux and the gravitational potential is nonlinear 
and dependent on potentially uncertain physics, such as fluctuations in the ionizing background. For this reason we 
need to appeal to simulations to quantify the correlation between forest observables (e.g. 5/ and -P//) and the lensing 
convergence. 

In this section we use hydrodynamic simulations to study the effect of magnification bias on the Lyman-a forest. 
Specifically, we study the "D5" simulation of [l^. This simulation was run using an entropy-conserving (20| version 
of the smoothed particle hydrodynamics (SPH) code Gadget [2l|. It tracks 324"^ dark matter particles and 324'^ gas 
particles in a simulation volume of co-moving side length L = 33.75 Mpc/fe, and includes a subresolution model for 
star formation 12211 and heating by a uniform background radiation field |23| . Further details regarding the simulation 
can be found in [19[. We generate mock Lyman-alpha forest spectra along random lines of sight through the simulation 
volume in the usual way, integrating thr oug h the SPH kernels of the particles, and incorporating the effect of peculiar 
velocities and thermal broadening (e.g. [Ij]). We generate mock spectra from simulation snapshots at z = 2, 3 and 4. 
In each case, we adjust the amplitude of the photoionizing background in the simulation to match the observed mean 
transmitted flux from [2^. We also make use of the "G6" simulation (see [1^), which tracks 2 x 486'^ particles in a 
L = 100 Mpc/ft- box, in order to check the sensitivity of our results to finite boxsize effects in Appendix [B] We have 
checked that the flux power spectra from these simulations approximately match the existing measurements. 

Magnification bias biases which lines of sight an observer will sec; sight lines with positive fluctuations in mag- 
nification are more abundant and those with negative less abundant. With simulations we can of course see every 
line of sight (a mock spectrum is generated regardless of whether there is a quasar behind it) so we mimic lensing 
bias by weighting each line-of-sight by 1 -I- S^{6). The simulations are in boxes of size 33.75 Mpc/h, so we can't 
actually calculate the lensing convergence k along the entire line-of-sight but instead for the weighting of absorption 
measurements at redshift z we use^ 

Q — rx^^ 

WQ = l + 5^{e)^l + -Hln„,{^s-2)^^^^x{l + -z) dx'S{x\e) (11) 

^ XQ Jx 

where L is the length of the box x = x{z), XQ is the distance to the quasar, 6 is the mass fiuctuation, and we have 
used the approximation \7\(f> ~ V^(/) = |i/QS7m(l + z)S which should be valid under the line-of-sight integral. Since 
the bias only comes from the mass fluctuations S which are correlated with flux fiuctuations Sf at the redshift we are 
considering, and correlations drop off rapidly with increased spatial separation, neglecting contributions to k from 
mass fluctuations at lower redshifts shouldn't be a bad approximation. We will verify this below by comparing the 
lensing bias thus obtained with that obtained from another method. 



^ While it would be more aceurate to leave the x terms inside the integral, their amplitude is slowly varying across the length of our 
simulations box so Eq. mil l shouldn't be a bad approximation. 
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FIG. 1: (a) The ID flux power spectrum measured from the simulations. Panels (b) - (d) show the fractional correction from 
magnification ((Pf/obs) — Pf f true) / Pf ftmo for mean redshifts z = 2,3,4 respectively. The error bars are suppressed for clarity. 
The amplitude of the correction depends on the redshift of the quasar zq through k (Eq. ([6}) and also on the slope of the 
quasar number count function, here we assume 5s — 2 = 3. 
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C. Bias to The Flux Power Spectrum 



In panel (a) of Fig. [l]we show the ID flux power spectrum 

1 



Pffih) = j\s(h)\ (12) 



measured from simulations'^. In panels (b)-(d) we show the fractional correction due to magnification, calculated by 
weighting lines-of-sight by 1 + (5^. Magnification weighs lines of sight with large k more heavily than those with low k. 
Measurements of Sf at redshift z can be taken from quasars at redshift zq for z < zq < (1 + z)i>i3/i>a — 1- The upper 
limit is set by where the Lyman-/? line (A^ = 1026A) can be confused with Lyman-a. The lensing weight function is 
peaked at x ^ Xq/2, so for quasars at higher redshift, x(^) is closer to the peak of the lensing weight function making 
the lensing correction larger than it is for quasars just beyond z. 

Magnification causes a < 1% change to Pff with larger changes occurring at lower redshifts. The results shown in 
Fig. [T] assume s = 1 and that all quasars within one's sample are weighed equally. The actual lensing correction in 
realistic surveys could differ by factor of a few. The current statistical error on the amplitude of Pff, averaged over 
scales, is ^ 0.6% from SDSS [23], which is a bit larger than the magnification correction shown in Fig. [1] However, 
as precision improves, a systematic offset such as that introduced by lensing could well be relevant. 

It is instructive to get an analytic estimate for the bias to the flux power spectrum. Equation tells us the lensing 
bias for the flux power spectrum is: 

(P//obs(fc||)> - Pfftrucm) - ■^{6fik\\)6}{k\\)Sn (13) 

where Ax is the length of the quasar spectra from which the power is measured. It is therefore clear that this 
lensing bias depends on the three-point function or bispectrum. Expressing 6^ as a line-of-siglit integral of the mass 
fluctuation (Eq. ([lip ), and judiciously applying the Limber's approximation [28l. [29|. we obtain: 

{Pffobsik\i)) - Pfftrucikii) « ^H^n„,{5s - 2)^^^^x(l + 2)i?//,„(fc||, -fc||,0) . (14) 

where XQ is the distance to quasar, % is the average distance to the forest, and z is the corresponding redshift. Here, 
Bffm is the (true) ID flux-flux-mass bispectrum at redshift z. It should be emphasized that i3y/m(fc|| , — fcy , 0) does 
not vanish despite having one of its arguments (the one associated with mass) being zero. This is related to the fact 
that it arises from a ID projection of a 3D distribution. For instance, it is well known that a 3D power spectrum and 
its ID projection arc related by Pi£)(fc||) = /^.^^ {kdk /2tt) P^]j{k), and so Pid generally does not vanish in the fcjj 
limit. 

We calculate the correction to Pff using Bffm measured from simulations. Estimating Bffm{k\\,—k\\,Q) requires 
an extrapolation from what we actually measure Bffra{k\\ , — (fc|| + Afc), Afc), where the smallest Afc is the fundamental 
mode of the box. How this is done is described in detail in Appendix [Bl In Fig. [5] we compare the lensing correction 
to the flux power spectrum as determined via the bispectrum to that determined by the weighting method described 
previously in ijll Bl The qualitative agreement between the two is reassuring. 



D. Bias to The Effective Optical Depth 



The effective optical depth Tefi is estimated from data by averaging over frequencies (and quasars) the ratio of 
the observed flux /= to the continuum /c, i.e. e""^""' is estimated from f / fc- As emphasized in ijll Al lensing 
magnification affects / and fc in the same manner, and therefore leaves f / fc unchanged on a quasar by quasar basis. 
Lcnsing's impact enters through the sample selection, resulting in (see Eq. ([5])) 

(e-"-«obs> - e-^'-«t™e = {e-^^-''5^) (15) 



^ Our Fourier convention is <5(k) = JdPxe *'''^(5(x). 
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FIG. 2: The magnification correction to the effective opticai depth at three different mean redshifts z. This correction depends 
on the redshift zq of the quasars. In practice, measurements of Tch involves combining source quasars at multiple redshifts. In 
the above plot we assume 5s — 2 = 3. 



where e""^'*' on the right is the mean transmission on a line-of-sight by line-of-sight basis, and e""^"'* true is the true 
mean transmission if only we could dispense with the quasar-weighting and average over the whole sky equally. 

The correction to the cfTective optical depth measured from simulations is shown in Fig. [5] for quasars at several 
redshifts. The net effect of lensing will depend on how measurements from quasars at different redshifts are combined. 
However, Fig. O indicates the effect should be ~ 10"'^ — 10^**. Current measurements of Tes have errors of at least a 
few percent at each redshift so at present lensing should not be an issue.'* 

An analytical estimate of the lensing bias on the effective optical depth can be inferred from Eq. ([15]), which gives: 

- 1 = ^^f^'^^ ^ - 2)|ani?o'^^^x(l + ^)Pf^{H = 0) (16) 

e ""true ^ XQ 

where we have used Limber's approximation, Pjm(fc||) is the (true) ID flux- mass power spectrum evaluated at the 
mean redshift of absorption z, x is the corresponding distance, and xq is the quasar distance. The lensing bias of 
this one-point statistic is thus related to a two-point correlation, just as the lensing bias of the power spectrum is 
determined by the bispectrum. As before, to evaluate the bias using this method, an extrapolation of the simulation 
Pfm to fc|[ = is necessary (see Appendix[B]). We find results that are consistent with those obtained by the weighting 
method (Fig. to about 30%, with the latter generally higher. 



E. Bias to The Flux Probability Distribution Function 



Another interesting statistic of the Lyman-a forest is the flux probability distribution function (PDF): 'P{f)df 
describes the probability that the observed flux lies between / — df /2 and / -I- df /2. Here / is the continuum- 
normalized flux (i.e. f / fc from the previous section). 



* The error bars for measurements of T^g are dominated by uncertainties in the continuum- fit. Typically, the continuum is estimated 
either by extrapolating from the red side (e.g. |3C|| ) , or by performing a smooth fit through portions of the spectra that are deemed 
unabsorbed (e.g. [Ssll ). We simulate the effect of the latter by renormalizing the flux along each line of sight by /max, the maximum 
along that line of sight. At low redshifts, this introduces a negligible bias to T^g, but by 2: = 4, Tgg is biased low by ~ 8%, in rough 
agreement with |25|| . However, we have checked that the fractional magnification correction is not significantly changed whether we 
renormalize by /max or not. 
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FIG. 3: (a) The flux probability distribution as measured from the simulations. Panels (fe) - (d) show the fractional correction 
from magnification for mean redshifts z = 2,3, 4. The amplitude of the correction depends on the redshift of the quasar zq 
and the slope of the quasar number count function, here we assume 5s — 2 = 3 (Eq. ((6|). 



The flux PDF measured from simulations with bins of df = 0.01 is shown in panel (a) of Fig. |3l^ The fractional 
correction from lensing magnification is shown in panels (b)-{d) of the same figure. The effect of magnification is to 
boost the low flux (high absorption) end of the PDF and decrease the PDF at the high flux end. The correction is 
< 1%, so unimportant at the current level of precision (5 — 10% on the flux PDF in bins of df = 0.05 [3l| but see also 



^ We use the same fake continuum fitting procedure described in footnote \^ but again this appears to make little difference to the size of 
the fractional lensing correction. 
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It has been suggested that to avoid systematic errors from contimium fitting one should measure the PDF for 
fluctuations in the flux Sf = f/f— 1 rather than measuring the PDF of / [IHl- In this case the flux is normalized 
along each line of sight, so the range of Sf for one line-of-sight is —1 to 1// — 1 where / is the mean flux along that 
line of sight. The range of the entire PDF determined from many lines of sight will be —1 to l//min ~ 1, where /min 
is the mean flux along the line of sight with the lowest transmission. 

The PDF for ^/ is shown in panel (a) of Fig. 3] Again the fractional correction due to lensing is shown in panels 
{b)-{d). In this case the correction due to lensing appears rather large at high 5f. The high Sf bins are dominated 
by lines of sight with the lowest mean flux, which are also most highly magnified. However, these high Sf pixels are 
quite rare, i.e. the PDF at high Sf is quite small and therefore noisy from an observational point of view. 
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For an analytical estimate we apply the fundamental Eq. ([8]), 

{VifUs)-V{f)true^{V{f)S'^) (17) 

where V{f)df is estimated from each line of sight in the standard fashion: counting the fraction of pixels with a flux 
that falls within df of /. The flux is a non-linear function of the gas density, but for simplicity, we will assume that 
gas traces mass and that / is a local function of the mass fluctuation S, i.e. / = F{5), where the function F is to be 
specified. Let us use the symbol p to denote the (average) location of interest, i.e. Sp is the mass fluctuation at point 
p. Thus, the flux PDF at p can be expressed in terms of the mass PDF Vm{5p): 

V{f) = j d5pV^{6p) Snif - Fidp)) = J dSpdSi P„(<5p, Si) Snif - F{Sp)) (18) 

where Sd is the Dirac delta function. For the second equality, we have introduced Si which is the mass fluctuation 
at some other point (the subscript I is used in anticipation of the fact that this will be where some lens is), and 
'Pm{Sp,Si) is the joint mass PDF at the two points, i.e. the one-point PDF Vm.{Sp) is related to the two-point PDF 
by 'Pm{Sp) = / dSiPm{5p, Si). The motivation for introducing the two-point mass PDF is to ease the computation of 
{V{f)S'^), where involves a line-of-sight integral over locations that generally differ from p. We obtain: ^ 

{ViDS") = -H^n^{5s - 2) / dxi^^^»(l + zi) / dSpdSi 5iVrn{Sp,Si)5D{f - F{Sp)) . (19) 
^ Jo XQ J 

We need a model for the two-point mass PDF. In linear theory, that is completely specified by the two-point correlation 
(SpSi). The relevant fluctuations in the forest arc not quite linear. Instead, we will adopt the lognormal model (see 
for example [36l Ist}): 

VmiSp, Si) = , ^ A (20) 

' 2TrVd^{l + Sp)il+ Si) 

where A is a 2-component vector with A, = ln(l -I- Si) + ^111(1 + (Sf)), and C is a 2 x 2 matrix with components 
Cij = hi{l + {SiSj)). Here i and j stand for p ov I. The lognormal model is simply one in which A is a Gaussian random 
field, and the observed S is related to it by 1 + (5 = cxp [A - (A^)/2], such that (A^) = ln(l + ((5^)). Substituting 
Eq. ([20]) into Eq. ([T9| and integrating over Sp and Si, we find 

^^^%^-^^^ = ^f]„(5s-2)i7o' r'd;^,^Q^x;(l + ^0(e^'''/^-'"^^+**)+^('^'''-^'Vc-»-l) , (21) 

>'(/)true 2 Jo XQ ^ ^ 

where 5* = F~^{f) is the actual density such that the observed flux equals the value of interest /. At this point, we 
need to specify F: We use F{S) = exp [-A(l + S)f^], with /3 = 1.58 and A = 0.2010, 0.9578, 2.960 at Zp = 2, 3, 4 
respectively. This approximates well what is in our simulations, though the simulated spectra were computed using 
the exact relation between the optical depth, baryon density, temperature and ionizing background. 

To use Eq. (j2ip . we need Cip and Cpp, which we compute using the nonlinear mass power spectrum, with suitable 
smoothing (for p) to account for the effective Jeans smoothing of baryons. The cosmology and power spectrum 
prescriptions arc described at the end of ^11 Al Figure [5] compares the analytic calculation in Eq. ([21]) with the results 
from simulations in i jllBI The two methods agree remarkably well considering the simplicity of the analytic method. 
If the linear mass PDF were used, the calculated correction has a slightly different /-dependence and is typically 
smaller by about a factor of 5 — 6. 

We should mention data that do not resolve the Jeans scale (or roughly, the thermal broadening scale) have an 
additional complication: even if the fundamental mass density is lognormal distributed (in both one-point and two- 
point sense), the flux field smoothed with a coarse resolution might not be well described by our model. In other 
words, smoothing and nonlinear transformation do not commute. In any case, the lensing effect on the flux PDF 
appears to be fairly small. 



Here, we arc abusing the notation a bit. On the left, ■P(/) strictly speaking denotes a stochastic quantity; it should be the estimator 
that simply counts the fraction of relevant pixels, whereas on the right, Vmi^, <5') denotes the true (non-stochastic) two-point mass PDF. 
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FIG. 5: Left: A comparison of the predicted lensing bias to the flux power spectrum using the weighted calculation presented in 
^11 Bl (solid lines) versus using Eq. (jldp with the flux- flux- mass bispectrum measured from simulations (points). From bottom 
to top the quasar redshifts are 3.1, 3.2, 3.4, 3.6. Right panel: The correction to the flux PDF from the analytic model described 
in Eq. (|2ip (solid lines) compared with the results from simulations (points). The analytic description with the log-normal 
model for the mass PDF works surprisingly well. 



III. LENSING AS SIGNAL: A TEST OF HOW NEUTRAL HYDROGEN TRACES MASS 

In the previous section, we have been exploring lensing as a source of measurement bias. Here, we explore the 
converse: lensing as a useful signal. We are interested in ways to extract signals of gravitational lensing from 
Lyman-alpha forest observations and thereby constrain the cross-correlations between flux and mass (for example, 
the flux-mass power spectrum and flux- flux- mass bispectrum present in Eq. (jl4p and Eq. (|16p). 

One option is to correlate the magnitude of quasars with Lyman-alpha forest observables (wc will consider other 
possible cross correlations at the end of this section). Given some observable Oj measured from a quasar labeled /, 
and its magnitude to/, we can form an estimator £: 




where iVqso is the number of quasars in one's sample. As is explained in mi Al the Lyman-alpha forest observable is 
always implicitly weighted by the number density of quasars (determined by the sample selection). And the quasar 
magnitude is of course modified by lensing, following from Eq. (fT|): 

,5m = -5K/lnlO. (23) 

It is shown in Appendix \K\ that 

{£) = 5S{kSO) (24) 

where dO represents fluctuations in O and s is defined as 

1 ^ dm e{m){dn^ / dm){m — fh) ^^^^ 
In 10 ^ dme{m)vP{rri) 

where to is the average magnitude in the sample i.e. to = J dm me(m)n'^ (m) / J dme{m)n^{m), and ^''(to) and i{m) 
are the luminosity function and selection function respectively — the definition for s is chosen to resemble those for 
s and s' (Eqs. [5], HH])- The s defined here is related to Cs defined in Q by Cs = —In 10 s. As first emphasized by 
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0, Cs = for a strictly power-law luminosity function in the sense of n°{m) oc e™. The realistic quasar luminosity 
function is not of this form, and we will adopt Cs ~ 1/3 from or equivalently s ~ —0.14, for our numerical 
estimates below. The normalizing factor in Eq. ([24|) is therefore 5s ~ —0.7. It is worth emphasizing that there is 
no '-2' term here, unlike for instance 5s — 2 in Eq. ([5]). The '-2' there arises from the geometrical increase in area 
by magnification. The statistic £ under consideration is immune to this effect. Further discussions can be found in 
Appendix |^ 

A. Magnitude-flux Correlation 

Let us first consider cross-correlations between O = 6f, the intervening flux fluctuation, and the quasar magnitude 
TO, giving 

{£5m5AXQ,X)) = 5s~(k<5/) « ^ij2f7^55^Q^^(l + z)Pfrn{k\l = 0) (26) 

^ XQ 

where XQ is the distance to quasar, x is the distance to absoprtion (i.e. where Sf is located) and z is the corresponding 
rcdshift. Here, Pfm is the (true) ID flux-mass power spectrum — the subscript to of Pfm stands for mass rather than 
magnitude (just as in ^IIPp . 

The nice thing about the estimator £sm Sj is that it is expected to depend on the location of absorption x in a 
predictable way, which allows this to be separated from other possible systematic effects, an example of which is large 
scale power from the uncertain continuum (shape). The fact that the amplitude of this cross correlation signal scales 
with s can also be exploited to test for consistency, for instance by isolating different subsamples of quasars with 
different values of s. 

In Fig. [6]we show the expected signal for this cross-correlation as a function of distance to the quasar xq ~ Xi where 
X is the distance to the absorption. To estimate the overall statistical significance, we can combine the measurements 
at different separations into an estimate for a single amplitude, namely Pfm{k\\ = 0). The appropriate minimum 
variance estimator is described in Appendix [C] where we also derive its signal-to- noise: 

N ^ {5m^) J 2ti Pff[k\\)+S.M. ^ ' 

where Esm5f{k\\) is the Fourier transform (over x) of Eq. (|26p . Nq^q is the number of quasars available, {Sm'^) is the 
quasar magnitude variance, Pf f is the ID flux power spectrum, and S.Af. is the associated shot-noise. 

The S.Af. term is important so we consider the signal-to-noise per quasar for two survey configurations: 'SDSS III 
configuration' with resolution FWHM is 60 km/s, and the shot-noise power is ^ 0.44 Mpc/h (3/(1 + z))3/2 [3| and 
'Keck configuration' with resolution FWHM is 10 km/s and shot-noise power is ^ 0.029 Mpc/h (3/(1 + z))3/2. For 
both we take the magnitude dispersion of the quasar sample to be 0.5 [6|. The first set of numbers roughly resemble 
the expectations of SDSS HI l39| but keep in mind the configuration we assume likely differs a bit from what will 
turn out in practice. The results for the S/N per sight-line are summarized in Table I. For the SDSS HI expectation 
of Nq = 160, 000 the S/N for this amplitude is expected to be only ~ 1 and even for a futuristic survey like BigBOSS 
[40| that could measure 10^ spectra with similar resolution only S/N ~ 2 — 3 could be achieved. Comparison with the 
'Keck configuration' shows that shot noise is clearly a limitation for these observables. However, the S/N estimates 
presented arc for a single rcdshift bin of width Az 0.2, but measurements will be made at a range of redshifts which 
could be combined to get a slightly higher signal-to-noise estimate of, for example, the mean Pfm across the redshift 
span of the survey. 

It is worth noting a few points that reduce S/N of the forest-magnitude correlations in comparison with galaxy- 
magnitude correlations used @. (1) The cross-correlation between flux and mass is lower than the correlation between 
galaxies and mass (the correlation coefficient is ~ Pfm / \/Pfj Pmm < 0.5). (2) Lensing peaks at a distance halfway 
between the observer and the quasar, but to avoid confusion with Lyman-/? we use only the part of the Lyman-a forest 
near to the quasar where the lensing is smaller. (3) The signal-to-noise of magnitude-forest correlations is (x \/Nq 
where Nq is the number of quasars with spectra, while for the galaxy-magnitude cross-correlation it is proportional to 
the number of galaxy-quasar pairs oc ^jNqNg (however in ijHIDI we briefly discuss correlating the forest with quasars 
along different lines of sight). 
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FIG. 6: Left: The lensing-induced correlation between quasar magnitude and tlie flux power spectrum given in Eq. (|28|l . 
The amplitude of the signal increases with increasing distance between x{^) s-nd the background quasar but above we assume 
the correlation is averaged over background quasars at redshifts between z + 0.1 and zp = Vi3/ua{l + z) — 1 (the maximum 
redshift for the quasar before confusion between Lyman-a and Lyman- /3 absorption sets in). Right: The magnitude-flux cross 
correlation (Eq. I26|) as a function of line-of-sight comoving distance to the quasar. 



Observable: 


S/N per l.o.s. for SDSS III configuration 


S/N per l.o.s. for Keck configuration 


(SfSm) at z = 2,3,4 


0.002,0.0029,0.0022 


0.0041,0.0036,0.0024 


(SPffSm) at z = 2,3,4 


0.0016,0.0028,0.0025 


0.008, 0.006, 0.0042 


if/fcSm) at z^ 2,3, 4 


0.0054, 0.0032, 0.00095 





TABLE I: The signal-to-noise per sightlinc in redshift bins of Az ~ 0.2 for the proposed estimators in Eq. (|26[l . (|29|) and pO|) 
for the SDSS III and Keck survey configurations (see ijlll A|) . Here we assume the quasar sample has s = —0.14 (see Eq. (|25p ) 
and the quasar magnitude dispersion is 0.5. For (f/fcSm) we assume the same fractional errors of f / fc as [S^l and that the 
errors are statistics limited so they scale as 1/ ^JNqso- 



B. Magnitude-Power Correlation 



Another possibility is to set O — Pff{k\\) in the estimator Eq. (p2)) . i.e. cross correlate quasar magnitude and the 
flux power spectrum. This estimator has the following expectation value: 

(f5™p„(fc||)) = |^(K<5/(A:||)<5}(fc||)) (28) 

where Ax is the length of the quasar spectra from which the power is measured. This expression is similar to Eq. 
p3)) . and indeed one can relate this to the bias in P// measurement we have calculated in ijllCI (Eq. [T4]): 

(£5,nP,, (fell)) = 7^((P//obs(fc||)) - P//truc(fc||)) . (29) 

This means that one could in principle use the magnitude-power correlation to correct for the bias in Pf f measurement. 
The expected magnitude-power correlation is shown in Fig. |6] and the signal-to- noise is given in Table I. 
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C. Magnitude- Mean- Transmission Correlation 



If the continuum can be accurately estimated, one could also correlate the mean transmission from each line of sight 
with the quasar magnitude behind it, i.e. choose O = [f / fc] where / is the observed flux and fc is the continuum, 
and the brackets [ ] denote an average along the line of sight. One can see that the expected correlation should be 
related to the bias in e""^"'* discussed in ^IIDI 

i^Srn [f/fc]) = ^7^(<^"'°"°bs> " e^^-t.^e) (30) 

The signal-to-noise per sight line is listed in Table I. Note that the number of quasars needed is realistic only if 
the continuum is estimated by extrapolating from the red side (such as in the analysis of [soj). The number of 
quasars should be much smaller if the continuum were estimated by performing a smooth fit through portions of the 
Lyman-alpha forest that are deemed unabsorbed, a method that typically requires high resolution data. Whichever 
the continuum estimation method, care should be taken in accounting for possible systematic biases (in the continuum 
estimate) when interpreting this cross correlation measurement. 



D. Additional Cross Correlations 



We have focused on cross correlations between the quasar magnitude and a Lyman-alpha forest observable in the 
same line of sight. There are two possible extensions we should mention in passing. One is that the quasar magnitude 
and the Lyman-alpha forest observable can come from different lines of sight. In other words, the cross correlations 
can be measured at a non-zero angular separation. On the one hand, the forest-magnitude correlation should drop 
off rapidly with increasing angular separation decreasing the signal, however the noise is reduced by the number of 
quasar pairs at a given angular separation. Roughly, the signal-to-noise for lines of sight at angular separation 9 
should scale as 



5^ ^ fS\ lO(XQ-X) Wf„M |Npa^rs{0) ^^^^ 



^/atsep.e V^/same l.o.s. XQ ^/^(fl = 0) y Nq 

where Wfm{d) is the angular flux- mass correlation function, Npairsi^) is the number of quasar pairs with angular 
separation 6 and the term with the ratio of the distances roughly accounts for the fact that this correlation doesn't 
require z to be above the Lyman-/? confusion limit for the quasar (previously we needed X > Xp 4/5xq and had set 
X ~ X0 + iXQ ~ X/3)/2)- It seems that the sparseness of quasars 16/(deg.)^ for SDSS III) does not permit Npairs{6) 
to be large enough that adding off-axis correlations will drastically improve the signal-to-noise in the near term. 

Another possibility is to cross correlate the quasar number density (which is affected by Icnsing through magnifica- 
tion bias) with the Lyman-alpha forest observable. Such a correlation makes sense only at a non-zero lag (or non-zero 
smoothing), since the Lyman-alpha forest is observable only if there is a quasar directly behind it. While we do not 
give explicit estimates here of the signal-to-noise for these cross correlations, they should be useful in disentangling 
the lensing signal from certain systematic effects, as we will discuss next. 



E. Dust and Other Systematic Effects 



We discuss here three systematic effects that could complicate the measurement of the various cross correlations 
mentioned above. 

The first is the continuum. The continuum presumably is smooth and therefore has fluctuations only on large scales. 
But since its precise shape is uncertain, a cross correlation such as the magnitude-flux correlation is susceptible to 
possible contamination from continuum power. A fortunate feature of the cross correlation is that it has a definite 
shape predicted by Icnsing, as well as an amplitude that scales with .5. Both can be exploited to check for such a 
contamination. 

The second systematic effect we loosely refer to as 'background subtraction'. Realistic spectra of quasars inevitably 
contain 'background' which can come from several sources, including the sky and scattered light within the optical 
instrument. While attempts are generally made to subtract these backgrounds as accurately as possible, there are 
inevitably residuals. These residuals could correlate with the quasar magnitude, for instance they could be more 
noticeable for fainter quasars. This would then produce spurious correlations when we correlate the quasar magnitude 
with Lyman-alpha forest observables (deduced from imperfect data that contain residual backgrounds). In fact. 
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existing flux power spectrum measurements from SDSS (27[ are known to exhibit an otherwise puzzhng correlation: 
that Pff is systematically higher for fainter quasars, and this correlation is statistically significant (the normalized 
correlation coefficient is '-^ 5— 10% [4l|). Such a correlation can be explained by this background subtraction effect. It 
cannot be explained by lensing since it tends to produce a correlation of an opposite sign unless s has a sign opposite 
to what is known. (It can also be plausibly produced by dust which we will discuss below) . To disentangle background 
subtraction issues from lensing, it would be useful to examine magnitude-observable cross correlations at non-zero lag 
- it would be quite surprising if background residuals cause correlations between quasar magnitude at one point with 
Lyman-alpha forest power at another. 

The third systematic effect is dust extinction. Dust modifies flux by / /e^'^''"=% where Tdust is the optical depth 
due to dust. This modification is frequency dependent (higher optical depth for bluer photons), but the frequency 
(which translates into scale) dependence is mild on the scales of interest, and therefore effectively acts as a continuum. 
The net magnification plus dust correction to the quasar number density is 

n ^ n/x2-5^-ie-2-^"^<'-' . (32) 

Including dust extinction, the measurement bias associated with a Lyman-alpha forest observable (Eq. [5]) is changed 
to 

(Oobs) - Otruc - (5S - 2){Ok) - 2.5s{06Tdnst) (33) 

where (Jrdust = Tdust — (Tdust), which is assumed to be small, and we have ignored (OnSTdust) ■ 

As far as the 'signal' part of our discussion is concerned, the magnitude-observable cross correlation (Eq. [M]) is 
modified to 

{£) = 5S{kSO) - 2.5s((5Tdust'50) ■ (34) 

A full calculation of the effect of dust is beyond the scope of this paper. But given a model for (5rdust the effect 
can be calculated in a way very similar to lensing - Tdust after all is another line of sight integral over the (dust) 
density. Calculations comparing the amplitudes of magnification and dust corrections to supernova fiux have shown 
(K(5Tdust) — ('^Tdust'^Tdust) to bc ^ 7 — 40% of (kk) at Zsource = 1-5 depending on the dust model At higher 

redshifts the dust term is expected to be less important, while the lensing effect should grow. On the other hand, 
recent measurements suggest that at z = 0.3 dust extinction can cause shifts in source magnitude comparable to those 
caused by lensing magnification For the Lyman-a forest, which is typically measured at redshifts z > 2, the dust 
corrections are likely to be smaller than the lensing corrections but should be included in a full analysis. 



IV. DISCUSSION 



We have discussed the sampling bias introduced by magnification and dust on measurements of the Lyman-a forest. 
In calculating the effect of this bias (summarized in Eq. ([5])) we made the assumption that all quasar spectra are 
weighted equally. If, on the other hand, forest measurements were weighted by quasar fiux, the effect of lensing bias 
would be larger than what we have found (see for example Eq. ©). 

If the quasars are weighted uniformly, we find that magnification bias leads to corrections < 1% to the flux power 
spectrum (Fig. [T]), < 0.1% to the effective optical depth (Fig. [5]), < 0.1% to the flux probability distribution function 
P{f), and as large as a few % to the probability distribution function for the flux fluctuation 'P{5f) (see Fig. |3]and 
Fig. 2]). These estimates assume quasar number count slope s = 1, probably reasonable for the SDSS quasars used 
for Lyman-a forest measurements. The lensing correction varies strongly with redshift with larger corrections present 
at lower redshift, largely due to non-linear growth of mass fluctuations. The biases induced by magniflcation to the 
effective optical depth are signiflcantly smaller than current error bars. For the flux power spectrum, the lensing 
bias is just within the current error bars, and since it leads to a systematic offset in each data point, may effect 
measurements of the overall amplitude of the power spectrum. At low redshift the lensing effect on the PDF of (5/ can 
be rather large, reaching several percent at the high 5f end of the PDF at z = 2, however this occurs only in regions 
where the PDF itself is very small < 0.1. Lensing magnification is probably unimportant for current measurements 
of the flux PDF and the quantities derived from it. 

One may wonder whether including nonlinear magnification^ - which is more important at the small angular 
separations relevant for the Lyman-a forest - could change our results. We have checked that including all k terms 



^ To be precise, the true magnification ^ = 1/((1 — ft)^ ~ "I't ~ ^1) where 71 and 72 are the shear. Throughout this paper we assume 
, 71 , 72 are small quantities and so we keep only the first order terms in the expression for ^. Non-linear magnification refers to the 
higher-order (in 7 and k) terms contributing to ^ O . 
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contributing to in Eq. ([2]) and Eq. pT|) . rather than just the hnear term as in Eq. changes the magnification 
bias corrections by < 2%. The true nonlinear magnification of course depends on the components of the shear as well 
as the lensing convergence but it would be surprising if they drastically changed our (all-orders in k) estimate of their 
importance* . 

An additional caveat to our analysis is that the damped Lyman-a (DLA) systems and their associated damping 
wings are not accurately modeled by the simulations and mock spectra. While DLAs are rare, magnification bias 
could make them more abundant in survey samples. The damping wings of DLAs arc known to add spurious power to 
P//(fc||) at the 10 — 20% level on large scales [44|. Magnification bias should increase the abundance of DLAs making 
the power spectrum more biased than our results suggest. Precisely how magnification affects the DLA bias deserves 
further investigation. 

Perhaps most interesting is that lensing magnification induces a correlation between Lyman-a observables and the 
magnitude of the quasars used to measure them (Fig. [5]). Unfortunately even with the large number of quasar spectra 
the BOSS survey will obtain it will be a challenge to detect correlations between the flux power spectrum, the flux 
decrement or the mean transmission and quasar magnitude with high signal-to-noise (see Table I). Additionally, it 
is possible that lines of sight with high magnification also have more metal lines which could further complicate the 
analysis. Nevertheless these correlations provide a direct measure of how fluctuations in the quasar flux trace the 
underlying density field, for example they could directly constrain the flux-mass power spectrum and should therefore 
be targeted. A more thorough analysis of how to detect the flux-magnitude and flux-power correlations is necessary, 
we leave this to future work. For a recent idea in a similar vein see [Tsl . [l^ who propose correlating lensing in the 
cosmic microwave background with fluctuations in the forest to extract flux-mass information. 

It is worth noting that, with a large quasar sample, it may be possible to exploit the dependence on 5s — 2 and/or 
x(zq) to isolate the magnification correction and to measure the flux- magnitude correlations. The lensing correction 
depends linearly on 5s — 2, which ranges quite a bit depending on magnitude limit (Fig. [7]). In the weak lensing 
limit the correction scales as Ho{x{zq) — x{z))x{z) / x{^q) which varies from ^ ^Hqx{zq) to as z goes from zp and 
zq. Indeed measuring these distinctive dependencies would help guard against possible instrumental systematics from 
being confused with the lensing signal. Our estimates of the forest-magnitude correlations in §111 AftTlI Dl assumed z 
was halfway between zp and zq. Also, there exist in public data ~ 150 close quasar pairs [l^, that one might wish to 
use for this analysis, unfortunately their number is small compared to the 10^ expected from SDSS III and will not 
significantly improve the signal-to-noise. 

There is quite a bit of interest in using the Lyman-a forest to map the 3D density field and use, for example the 
baryon features in the two-point correlation function or power spectrum to constrain dark energy and the expansion 
history of the universe [46j . While we have focused on the power spectrum gotten from ID measurements of the fiux 
fluctuation, the analysis could be extended to include correlations between different quasar spectra (and therefore 
different sight-lines). Additional corrections due to correlations between flux and convergence across different lines 
of sight (^ {Sf{xd)Sf{x'6')Kix^))) would arise. However, since lensing is strongest for lenses along the same line of 
sight as the sources, the terms calculated in this work should be dominant. One would therefore expect the lensing 
bias to the correlation function around the baryon scale to remain 0.1 — 1% ^. 
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Actually, there is a further caveat here in that we didn't compute the true lensing convergence k because we have ignored contributions 
from mass fluctuations at the lower redshifts outside the box. For the non-linear terms, this shouldn't be a good approximation (e.g. 
(kk) is poorly estimated) nevertheless it would be surprising if this made an orders-of-magnitude difference in the effect of magnification 
bias. But perhaps this plus the ignored 7 terms could increase the importance of non-linear magnification to the ~ 20% reported by 

a- 

We have checked that unlike the case of the 3D galaxy correlation function l47t l48f nothing is remarkably different about the lensing 
bias to the real space flux correlation function as compared to Fourier space. 
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(a) (b) 

FIG. 7: (a) A histogram of the number of quasars as function of i-band magnitude measured from the Sloan Digital Sky Survey. 
(6) The slopes s (solid) and s' (dashed) defined in Eqs. ([2} and (|10|l as a function of the limiting magnitude (i.e. assuming 
a step function e). These measurements ignore possible incompleteness in the data, and are therefore likely underestimates, 
especially at the faint end. 



Appendix A: Derivation of Magnitude, Number and Observable Cross-correlations 

In this appendix, we derive the main results of this paper in a unified manner. They inelude the lensing induced 
measurement bias (for both uniform and flux weighting) and the general magnitude-observable eross-eorrelation, 
described respectively in Eqs. ©J (|H1), (HI & Most of these reduce to known results in the literature for a 

step-function selection. 

All quantities of interest take the following form: 

Q = (Al) 

where / sums over quasars in one's sample, wj and u/ are explicit weights one might apply to them {wj and uj might 
or might not be equal), and Oj is a Lyman-alpha forest observable associated with quasar /. 

It is helpful following ijll Al to conceptually pixelize the survey (with pixel label i), and rewrite this as 

/ dmu{m)e{m)ni{m) 

where dmni(m) is the number density of quasars at pixel i with magnitude m±dm/2, e(m) is the selection function 
(e.g. e(m) could be 1 for all quasars brighter than some limit miim. and otherwise), and we assume the weights w 
and u are functions of the observed quasar magnitude m. 

Recall that lensing modifies the observe flux by / ^ / + (5/ = //i = /(I + 2k), where k is the (weak) lensing 
convergence defined previously in Eq. ([6]). Flux and magnitude are related by / = exp [—to In 10/2.5], and so 
TO = mP + 5m = mP — j^^k, where mP and to are the unlensed and lensed magnitudes respectively (Eq. |23|). The 
observed number density of quasars ni{m)dm is related to the pre-lensed number density n'^{mP)dmP by 

n,{m)dm = n°{m°)dm°Y^-^ (A3) 
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where 1/(1 + 2Ki) factor accounts for the geometrical increase in area by magnification (and therefore reduction in 
number density). Taylor expanding gives us 

n,(m) = n^m) + [^^^ " (A4) 
The denominator of Q is well approximated by 

denominator of Q = / dmu{m)e[rn)'nP{m) (A5) 

where we have assumed that the survey is large enough that under the summation over pixels i, rLf{m) which could 
fluctuate from pixel to pixel can be replaced by its average •nP{ni) (i.e. the true pre-lensed luminosity function), and 
Ki ^ Q. The summation then simply gives the number of pixels in the survey. This kind of approximation is 
equivalent to ignoring corrections of the integral constraint type |15l |. 

The numerator of Q can be computed by rewriting Oi = Otrue + 50i, where Otruc denotes the ensemble average 
Ctrue = {Oi) (i.e. an average over ensemble of realizations of the universe). Assuming there is neither correlation 
between Oi and the pre-lensed number density n^, nor correlation between and n^, we find 

(numerator of Q) = / dm ■u;(m)e(m)n°(m)C'truo + / dmw{m)e{m) f ,^'-f„ ^"/"^^ — n"(m)^ 2(kSO) (A6) 
^-^ J ^-^ J V In 10 dm I 

i i ^ ^ 

where we have dropped the i label from {niSOi) since this is simply a cross-correlation at zero-lag. Just as for the 
denominator, the summation reduces simply to the total number of pixels. 
In summary, we find 

, , Umw{m)e{m)n'^{m) , , / dmw{m)t{m) ( j^fg ''""^"^ - n°(?7i)) 

(Q) = Otrue ^V:^ + 2(«'50> } .Of ^ (A7) 

J dmu{m)t[m)'n}'(ra) j amu[rn)e[m)rv'[rn) 

This is a fundamental result from which everything follows. Let's first apply it to the case with w{m) = u{m) = 1 i.e. 
Q is what we have been calling Cobs, with equal weights applied to all quasars within one's sample. Eq. (|A7P tells us 



, 1 ( dm eira^idnP I dm) ,. , 

Oobs =Otruc+ 5s-2 , s= /\ ' ' , AS 

mlO j dme[m)rv-'[m) 

consistent with Eq. ([8|) and the definition of s in Eq. ([2]). Note that {k50) = (kO) since (k) = 0. If the selection 
function e(m) were a step-function, s reduces to the more familiar slope of the quasar number count upon integration 
by parts (see f|T|. 

Let's next try w(m) ~ u{m) = exp [—m In 10/2.5] in Eq. (jA7[) . i.e. Q is equivalent to Cobs; with a flux weighting. 
It is simple to see that 

in \ n ^(^' 9V An\ ' ^ / dme(m)(dn7dm)10-"/2.5 

in 10 J dm e{m)n'^ {m)10 a 

which is consistent with Eqs. ([9]) & ()10p . 

Lastly, let's use w{m) = m — m and u[m) = 1 in Eq. (jA7p . with fh = J dm m e(m)rfi (m) / J dm e{m)n^ (m) . 
Then, Q corresponds to the cross-correlation estimator 8 of Eq. (|22]) , keeping in mind that m (which is the average 
sample magnitude defined by the true unlensed luminosity function) should be well approximated by mj / for 
a sufficiently large survey. We obtain: 

ic\ K~/ xn\ ~- 1 J dme{m){dn"/dm){m - fh) 

{£)=5s{KdO) , S^-— jr- , , „, , (AlO) 

mlO j dm€[m)n'^[m) 

which is consistent with Eqs. ([^ & (^5]) . Note how the '—2' that is present in both Eqs. (|A8|) & (|A9p is absent in 
(IA10[) . In the first two cases, this '—2' originates from the '— ri,°(m)' in the second term of Eq. (jA7p . This term yields 
zero in Eq. (jAlOP by definition of w{m) = m — fh. 

The different symbols s, s' and s can be seen as different (normalized) moments of e{m)dn^ / dm. In Fig. [7] we show 
the cumulated number counts and rough estimates of s and s' from SDSS data release 6. It is worth emphasizing 
that incompleteness is not taken into account in deducing these estimates, and the true values of s and s' could 
well be higher, especially at the faint end. In other words, if incompleteness is present, it should be properly taken 
into account through the efficiency e(m). Blindly measuring s by taking derivative of the observed (i.e. incomplete) 
cumulated number count would result in an underestimate. 
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FIG. 8: The flux-flux-mass bispectrum (points) compared with the hierarchical model given in Eq. (IBlf) (solid lines) at redshift 
3. The simulations box-size L = 33.75 Mpc/h. 



Appendix B: Determining the Low-fc Bispectrum 

In this Appendix wc discuss how we determine the flux- flux- mass bispectrum at low-A:||. The issue is that the 
simulations have a finite box size and we need to extrapolate _B//,„(fc||, — (fc|| + fc||),fc||) from fc'i 27r/L to fc|| ^ 0, 
where L is the size of the box. At z = 3 we have simulations in a larger box and we find that doubling the box size 
(from L — 50 Mpc/h to L = 100 Mpc/h) increases i?//„i(fc||, — fc||, 0) by a factor of about four. To determine Bffm 
for very low k we first use a hierarchical model for Bffm, 

+fc||),fc||) =Q//m(/'/rn(fc||)F/™(fcf|)+P/,„(fc||)P/™(fc|| +fc^^ . (Bl) 

This hierarchical model with Q f f„i independent of k\ \ but varying with z appears to be a pretty good fit to the 
simulations-measured bispectrum for /c||,/C|| < Ih/Mpc with better agreement z — i and z = 4 than z = 2. At 
fc|| > 1 h/Mpc the hierarchical bispectrum drops off much more rapidly than the true bispectrum. Comparison of the 
hierarchical and true bispectra is shown in Fig. [51 Given the agreement between the hierarchical and the measured 
bispectra (on large scales), we now assume Eq. (|B1[) works at Bffm{k\\, 0) and embark on the somewhat easier 
task of determining Pfm at very low fc||. 

The low fc|| value of Pfm is determined using both the simulations and analytics. Precisely, we set Pfm{k = 0) = 
Pfm''{kmin)/Pbm^'{kmin) X Pbm'^' {^^ — 0)- The Calculated 1-D baryon-mass power spectrum at fc|| = is given by 

n™'^- = / ^PMk±)e-'-/'' (B2) 

where PzD{k) is the 3-D mass power spectrum, k^ = \J^kj with kj = ^JAtyGp/ {cs{1 + z)) is the Jeans length for a 
system with sounds speed Cg [i^ and proper mass density p. Assuming the intergalactic medium can be treated as a 
monatomic ideal gas Cs = y^5T/{3m) where m is the mean particle mass. For a fully ionized gas composed of 75% 
Hydrogen and 25% Helium by mass m = 0.59mp where nip is the proton mass. We assume T = 20, OOOii' is the gas 
temperature constant over the redshift range we are interested in [50[ . 

The reader might wonder whether analytic estimates of the flux-mass power spectrum and flux-flux-mass bispectrum 
can be used to make accurate predictions without simulations. A naive guess, Sf = bSf,{k) works fairly well for Pfm 
but overestimates the magnitude of Bffm by a factor of a few. The dominant reason for this discrepancy appears 
to be the difference between Qffm and Qmmm, where Qmmm is the equivalent parameter for the hierarchical mass- 
mass-mass bispectrum. Hyper-extended perturbation theory [5l| suggests that on the scales we are interested in. 
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the hierarchical ansatz should fit the mass bispectrum Bmmm with a factor Qmmm ~ 3 (and only weakly dependent 
on redshift). Indeed our simulations give a similar value for Qmmmi ^md the hierarchical ansatz fits Bmmm rather 
well. However, for the flux-flux-mass bispectrum we find that Qf/m < 1 and varies more strongly with redshift. So, 
because of this difference the assumption Bffm{k,k',k") = b^Bmmm fails even on large-scales where P// ~ b^Pmm- 
This is why we have to rely on simulations to obtain an estimate of the flux-flux-mass bispectrum, but making suitable 
corrections to get at the (fc||,— fc||,0) limit. It is also worth emphasizing we have estimated the measurement bias 
associated with the flux power spectrum two different ways, one using the flux-flux-mass bispectrum and the other 
doesn't, and they agree reasonably well (Fig. O. 



Appendix C: Estimator and Error for the Magnitude-Flux Correlation Amplitude 



Recall that the expected magnitude-flux correlation from Eq. takes the following form 

{£sr,iSf{x)) = 9{x)A 



(CI) 



where g{x) is a function of x (distance to absorption), and A represents the amplitude of this correlation, which one 
can equate with Pfm{k\\ = 0) if one wishes (the precise choice will have no bearing on the final signal-to- noise of 
interest). We have suppressed the dependence on XQ (distance to quasar). 

Our goal is to come up with an estimator for this amplitude A by combining the magnitude-flux correlations at 
different scales (x's), and show that its error bar is given by Eq. (P7)) . The estimator takes the form: 



SA = Y.'^'{x)Sm'Sj{x) 



(C2) 



where the / index labels the quasar, and the sum over x ranges over the (binned) scales of Lyman-alpha forest 
observations. Here, Sm^ — nrJ — m, with fh being the mean sample magnitude (see discussion around Eq. |A10| ). 
and (x) represents a weighting of the data over quasars and absorption locations which remains to be specified. To 
satisfy {8a) = A, we would want {x) to satisfy 



J2w'{x)9ix) = l 



The variance of this estimator is given by 



E ^'ixWix') [{Sm'Sjix)Sm-'Sj{x')) - {Sm' Sjix)){6m'' 6j {x'))] 



(C3) 



(C4) 



Assuming the terms in [ ] are dominated by Sij{6rn^){Sf{x)Sf{x')) (i-e. correlations between different quasars are 
weak, and that cross correlations between magnitude and Sj are also weak compared with auto correlations), we find 



(Sm^) w\x)^ffix,x'Wix') 



(C5) 



where we have used ^//(XiX') to represent {Sf{x)^f{x')) ^ it should be kept in mind that this should include con- 
tributions from both the intrinsic forest fluctuations and shot-noise. The shot-noise contribution makes strictly 
speaking a function of the quasar index /, but we will ignore it for our purpose of a crude S/N estimate. 

Standard minimization technique applied to Eq. (|C5I) subject to the constraint Eq. (jCSp gives us the minimum 
variance weighting: 



w'ix) 



Y.^-f}{x,x')9{x!) 



I 



E 3(X')C7/(^''X")9(X") 
J.x'.x" 



(C6) 



where t,^^ is defined to be the matrix inverse of ^// i.e. ^ffiX:X')iffix' tX") = ^x,x"- With this optimal 

weighting, the variance is given by 



N, 



QSO 



E 9ix%}ix',x")g{x") 

x',x" 



(C7) 
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where iVqso is the number of quasars. 
The signal-to-noise of interest is 



which after some Fourier manipulations is equivalent to Eq. (j27p . 

Let us close by noting that in the same spirit and notation, the magnitude-flux correlation (at a given x) ha-s a 
variance of 

^''^'hffix,x) (C9) 



QSO 
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